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In this Issue 

^^^^^^^^^H A basic computer system has a central processor, some main memory, 
^^^^^^^^^^1 and some mass memory. The mass memory is slower than main memory 

1^^^^^^^^^^ but has much more capacity. Recent trends are lor processors to get faster, 
and for semiconductor main memory to gel not only faster but atso cheaper. 
SS^ ^'^ computer systems have more of it. Disc memory mass storage systems 
jp^ ^3ve grown in capacity, but disc access times haven't changed much. The 

result IS that m many systems, fast processors and main memories are 
underemployed, slowed down by the disc drives they have to work with. On 
page 21. two HP computer scientists report on their research into this prob- 
lem, the various possibilities for alleviating it, and the solution they chose for the HP 3000 family 
of computers. Their answer is disc caching— temporarily storing in excess main memory the disc 
files that are most likely to be needed soon. This way the processor needs to access the slow 
discs much less frequently. It's the same principle that hasled to the useof very fast semiconductor 
cache memories between the processors and the mam memones m many high-performance 
computer systems. And it does work — an HP 3000 with disc caching completed a benchmark 
program (aster than another computer that had aprocessorfive times faster but no disc caching. 

On our cover this month is a photograph of a magnetostatic-wave delay-line filter, fvtagnelostalic- 
wave devices are being studied by HP's central research laboratories for applications in delay 
lines, couplers, signsl-to-noise enhancers, filters, and oscillators for the microwave frequency 
range, roughly 0.5 to 26.5 gigahertz. These devices depend on magnetoslatic waves propagating 
in thin films of ferrimagnetic materials. Although they've been studied since the 1 950s, they didn't 
begin to show attractive performance until better methods for growing high-quality thin films were 
discovered in the 1970s. On page 10, two HP Laboratohes scientists review the basic principles 
of these devices and report on recent results indicating that magnetostatic wave devices may 
soon be ready for large-scale commercial application. 

HP 9000 Series 200 Computers are widely used as personal workstations for engineers. The 
article on page 4 describes a document editing software package for these computers that makes 
it much easier for engineers to produce such documents as engineering notes, product change 
descnptions, and design specifications. HP TechWriler is compatible with the Pascal 3.0 operating 
system and with a wide range of printers, including HP's LaserJet printer. Text and graphics are 
electronically merged and appear on the computer screen as they will in the printed document. 

-R.P. Dolan 



What's Ahead 

In March, we'll have articles on two tape-drive designs. One drive is a 'A-inch carthdge tape 
unit that can back up a 60-megabyle Winchester disc in 30 minutes. The other is a high-perfor- 
mance, low-cost, dual-density Ys-inch unit for backing up systems that have 400 megabytes or 
more of storage. We'll also have an article on HP Maintenance Management, an equipment-mainte- 
nance software package for HP 3000 Computers. 
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HP TechWriter: Illustrated Documents for 



Engineers 

This document editing software package for l-iP 9000 Series 
200 Computers eiectronicaliy merges text witfi pictures from 
many IHP grapfiics software pacl<ages. Text and graphics 
appear on the screen as they will in the printed document. 

by Elaine C. Regelson and Roy E. Anderson 



HP TECHWRITER is a documenl editing software 
product that assists engineers in producing illus- 
trated documents, as shown in Fig, 1, It runs on HP 
9000 Series 200 Computers in ihe Pascal 3.0 operating sys- 
lem. In addition fo providing illustrations, HP TechWriter 
offers significantly enhanced document support com- 
mands and numerous hard-copy options. It was recently 
enhanced to support larger pictures and more hardware, 
including the HP LaserJet printer. 

HP TechWriter consists of three original programs [Fig. 
2), The Editor is used for text entry and editing and produc- 
ing listings with draft-nuality pictures. The Lister produces 
listings of multiple files with final-quality pictures and 
generates tahles of contents. The Picture Processor converts 
plot files produced by other programs into picture files 
suitahle for illustrating documents. 

Engineers, scientists, and first-level lochnicai managers 
are the target users for HP TechWriter, These people need 
the ability to include drawings, charts, and sketches in 
tlieii' documents. Our analysis of these users indicates that 
they are willing to spend some time learning this type of 
product, but they insist that it be very responsive (fast), 
and thfll it be easy to use once they learn il. 

There was a text editor already available on HP's Pascal 
workstations that had some of these qualities, but not the 
needed functionality. II proved possible to start the de- 
velopment of the HP TechWriter Editor by leveraging Ihis 
editor. The Pascal workstation editor is very fast al lend 
display and scrolling. Most engineers master it in about a 
day of intensive use, and experienced users find it quite 
easy to use, with little overhead in the commands. This 
made it a logical foundation for HP TechWriter, 

HP TechWriter allows users to produce most internal 
publication-quality documents, such as engineering notes, 
product change descriptions, and design specifications. It 
also allows users to take advantage of any capabilities of- 
fered by their printers, such as math, bold, or italic fonts. 
The new features required major enhancements to the Pas- 
cal workstation editor, 

HP TechWriter Features 

The HP TechWriter Editor retains the basic qualities and 
speed of the standard Pascal workstation editor, and the 
data files of the two products are fully compatible. The HP 
TechWriter Editor offers an extended set of features, allow- 



ing the user io do basic editing, to set and use the global 
editing environments, to specify illustrations, to tailor hard 
copy, and Io tormal documents. There are two kinds of 
command interfaces. Global commands are entered from 
the keyboard and take effect immediately. Document, or 
embedded, commands are inserted into the file, and affect 
the text following them. 

HP TechWriter provides basic editing functions includ- 
ing insert, delete, change, copy, find, and replace text, 
produce a hard-copy listing of the document, and save, 
retrieve, and copy from text files. The user can scan and 
work ihrough the entire text file quickly by using arrow 
keys, or a knob if it is available. The user can also move 
by screenfuls of text, or to specific marked locations in the 
file. In addition. HP TechWriter offers several features use- 
ful for document text preparation, such as paragraph filling 
and word wrap, ragged and straight right-edge justification, 
flexible line adjustment (center, right, anywhere), and the 
abilily to specify and change margins easily and override 
global margins whenever desirable. The Editor stores the 
most recently deleted text to allow the user to move text 
or recover from an unintentional deletion. 

Embedded commands let the user force new pages, 
specify headers and foolers, mark table of contents entries, 
set spacing, and turn off listing of particular areas of the 
document. A reminder-level help facility lists all command 
names and options and states very briefly how they work. 
This is generally all the prompting that is needed to support 
full use of all features. 

There is a global environment tor each document that 
defines standard margins and the text entry mode. The 
document mode results in word wrap, while program mode 
preserves all entries as typed. The global environmenl also 
defines special characters including the command charac- 
ter, which is used lo mark command lines embedded in 
the file. The default command character, , is used in all 
examples and illustrations in this article. In addition to 
the global environment, both the Editor and the Lister use 
a list enviroTunent. which defines the printer type and the 
print destination and includes parameters specifying the 
position of the print on a page. 

The user can tailor HP TechWriter by setting default 
values for all parameters in either environment in a special 
data file called the configuration file. This file also allows 
the user to enter all text that appears in any hard copy (the 
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default footer, table of contents column headings, etc.). The 
user can also specify the format for the dale [month/day ' 
year, day-month-year, etc.) printed in headers or foolers. 

Real-Trme Margi nation 

HP Tech Writer recognizes paragraphs when it is in docu- 



ment mode, and formats them according to their margins, 
either when their entry is completed or by explicit com- 
mand. Margins are specified in the global environment and 
may also be specified by local commands. The user can 
specify ragged or straight right margins. Document com- 
mands can also be used to turn off margining for a region 



HP TechWriter is an engineering documentation product that 
provides Series 2 00 customers with a version of the familiar Pascal 
text editor that has been enhanced to allow customers to create and 
print illustrated documents. The illustrations are contained in plot 
files produced by graphics editors such as HP EGS/200 or by customer 
programs . 



HP EG5/200 drawing : 




Pictures are drawn scaled to a size specified by the picture 
command. If you do not need to see the pictures, HP TechWriter allows 
you to turn off the display of your figures if you wish to. The editor 
also provides a number of convenient features for developing documents, 
such as right justification, multi-line headers and footers, and 
automatic table of contents generation. 



VinQ 



■ Sluf 



IJI XiSJL 



side by side text and graphics 
allows you to produce annotated 
pictures on any printer which can 
support this feature. Local margin 
commands allow you to use special 
margins for quotation text or the 
text describing your picture, or to 
turn margins off completely for 
tables. 



This sample was produced on a LaserJet printer (HP 2ea6A) showing 
its variable resolution graphics capability - the top picture is at 150 
dots per inch while the lower picture is at 75 dots per inch. 

Fig. 1 . A sample o' an HP TechWtiler document The user sees the documani on the display 

as it appears m this hard copy 
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Fig. 2. HP TechWriter consists of three programs It accepts 
picture liles liam many HP graphics software products and 
works with most HP printers that have graphics capability. 

of the document. Turning off margination with a margin 
document command is a feature that is used to protect 
tables and similar text from being rearranged by the margin- 
ing operation. 

When margining text into a paragraph, HP TechWriter 
first checks to ensure that margining has not been turned 
off for that paragraph. After verifying that margination of 
the current text is allowed, the text ahead of the paragraph 
is scanned for a document command specifying local mar- 
gins. If an appHcahle margin command exists, margin val- 
ues are ohtained from it. Otherwise, the global environment 
margins are used. 

Each line of the new paragraph is then constructed word 
by word, with single spaces separating words regardless 
of how many spaces were originally typed. When a word 
is encountered that exceeds the right margin, a new line 
is started. When the last word of the paragraph is reached. 



Ihe margining operation is finished if a ragged right margin 
was specified. If a straight right margin is indicated, spaces 
are added between the words of each line until the ends 
of the lines align with the right margin. To avoid having 
paragraphs ihat luuk mure dense on one side than the other, 
HP TechWriter inserts the needed hianks from alternating 
ends of the lines, distributing them across the line, ia an 
"even" manner, but with priority given to words that end 
with a punctuation mark. Margining is very fast. An average 
paragraph is remargined in about 200 millisecnnds. 

Illustrating Documents 

Ulustralioiis used in ducuments may. at Ihe user's option, 
be displayed on the screen while editing the document. 
Using this, the user actually views the picture while writing 
about it and can see how the picture will look next to the 
text before producing the hard copy (Fig. 31, 

To illustrate documents, the user must first produce a 
plot file of a picture from a graphics editor such as HP 
EGS/200 or HP Graphics Presentations/200, or from pro- 
grams written by the user in BASIC 3.0 or Pascal 2.1/3.0. 
This file must be processed using the Picture Processor, 
The Picture Processor reads the Hewlett-Packard Graphics 
Language (HP-GL) commands in the plot file, finds the 
limits of the drawing for ease in scaling the illustration, 
and writes a binary file containing the move, draw, and 
linetype data from the plot file. A binary file is used be- 
cause the Editor can read it much faster than i! can read 
the original ASCII file. 

HP TechWriter is one of very few products that displays 
text and pictures simultaneously for the user. High-speed 
picture display is vital to making this feature useful. To 
include a processed picture in the document, the user en- 
ters a drawing command as shown in Fig. 4, 

The drawing command line contains the file name of the 



Fig. 3. An example ot an HP 
TechWriter screen. 



>THBdit: Adjuit Copy Dalita Find Inttrt Juap RapUca Quit Xchange T 
HP TachHritar lata you actuaDjr vlaw jreur plcturai en your acraan ahlla 
you arile about thea. 
"D F-ED:hubd, L-22, R-62 



SIMPLE driHtng: 



"D F.ED:eircuit, L-B, R'38 



Vin 




Sida by side text and graptiict 
al lava yeu te produce annotated 
pleturei on any printer nhtch can 
aupport thli faatura. Local Mrgin 
coaHnda alia* you to u«e apectal 
aargin* for quotation teit or the 
text deicrlblng your puturs, or te 
turn aarglna off coapleteiy for 
tablea. 



6 HBMLm-PACKARD JOURNAL FEBRUAHY 1985 



© Copr. 1949-199B Hewlett-Packard Co. 



processed picture file and a specification for the left and 
right picture edges in columns. The drawing is completed 
by the next command line thai appears in the file. The 
picture is draivn between ihe drawing command hne and 
its terminator (the next command line) and between the 
right and left margins. The picture in Fig. 4. for example, 
is eight lines high and is drawn het ween columns 0 and 15. 

Plot Files 

HP Tech Writer was originally conceived as a companion 
product to ihe HP Engineering Graphics System (HP EGS, 
200] to allow users of that system to produce documents 
illustrated with their drawings. Using the HP EGS/200 file 
structures, however, would have required a special in- 
terpreter to read those files, and would have limited HP 
TechWriter to illustrations produced by HP EGS/2D0. HP- 
GL plot files were ultimately chosen as the picture input 
file for HP TechWriter illustrations because they can be 
produced easily by any program that drives a plotter. The 
standard BASIC 3.0 and Pascal 2.1/3,0 (DGL) operating sys- 
tems for HP 9000 Series 200 Computers produce these plot 
files. This allows the user lo choose from a wide variety 
of currently marketed sources for document illustrations 
including, in addition to HP EGS/200, HP Graphics Presen- 
tations/ 200, HP Statistics Library/200. Graphics Editor/200, 
and Data Grapher/20n- 

HP TechWriter interprets only the limited subset of HP- 
GL commands produced by the standard BASIC and Pascal 
systems. These commands |PU, PD. LT, SP, and PA) produce 
consistent results on all HP plotters. 

Text and Graphics Display 

To maximize Ihe size of a document that can be handled 
by the Kditor, a drawing is read in from the disc and re- 
drawn each time il appears on ihe screen. The process of 
reading and drawing the picture is relatively slow com- 
pared to Ihe rest of the editing process. HP TechWriter 
minimizes the number of times Ihe picture is redrawn by 
scrolling Ihe picture when possible and keeping Irack of 
ils location, redrawing il only when necessary. If there is 
no need to see the drawing al a particular time, the user 
can turn the picture draw option off, eliminating Ihe time 
needed for redrawing. 

Most Series 200 Computers have screens with two planes 
for display, alpha for the text and graphics for pictures. 
Either plane can be turned on or off independently. To 
display environment or help screens and then return lo 



the main text display, the Editor turns off the graphics 
plane and redraws the alpha plane to show Ihe desired 
display. 

The t^vo independent planes present a significant prob- 
lem when trying to scroll pictures. Since the information 
for draiving a picture is not stored, pictures scrolled off 
the screen could lose tops or bottoms when scrolled back 
on. HP TechWriter solves the problem by scanning the 
displayed section of text and displaying only pictures thai 
completely fit on the screen. When scrolling, the program 
first shifts the alpha plane one line, then shifts the graphics 
plane one alpha line (B-12 graphics lines). Assembly lan- 
guage procedures were written to shift graphics display 
lines quickly up or down a specified number of lines, and 
to erase only selected graphics lines. These special routines 
result in smooth and fairly fast scrolling for al! Series 200 
Computers, although moving the graphics plane of the 
Model 23BG's color display is sJow enough that a mild 
"ripple" effect is visible while scrolling pictures. 

The HP 9000 Model 237 display is a high-resoiulion 
bit-mapped display that represents a departure from the 
separate alpha and graphic planes architecture, since il 
makes no distinction between alpha and graphics display 
information (it is all graphics). As a resuU. it represents 
both a complication and a simplification for HP TechWriter 
lo .support. On one hand, scrolling becomes a single oper- 
ation vrith no distinction between scrolling alpha and 
scrolling graphics, but on the other hand, displaying text 
over a picture and switching to alternate HP TechWriter 
displays is more difficult. 

Instead of turning off the graphics display, as is done on 
the other Series 200 Computers. HP TechWriter erases the 
Model 237 display. However, the Model 237's bit-mover 
hardware makes this operation very fast. The unavoidable 
result is that the equivalent of "turning on" the display 
now involves redrawing the picture from disc. 

Displaying text over a picture on most Series 200 Com- 
puters is simple, since there are two display planes. For 
the Model 237, the replacement rule register is used to 
cause figures to be ORed as they are drawn with any existing 
text displayed on the screen. Again, the result is a high-per- 
formance integration of alpha and graphics information 
displayed by the Editor. 

Printing Side-by-Slde Text and Graphics 

HP TechWriter is exceptional in ils ability to print out 
pictures with, next to, or even on top of the text in the 



D F-hubd L-0 R-15 



Any text entered between the picture command and its 
terminator is considered part of the drawing. It is 
positioned next to the picture when the document i s 
printed just as it was on the screen. 




Fig. 4. To include a drawing in Ihe document, the user er)ters a drawing command as shown 

on Ihe top line in lhl& hard copy 
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HP TechWriter Security 

HP TechWriter uses a software security rnectianisrn developed 
for HP EGS/200 The mechanism is designed to allow program 
copyrng, buL !o limil full use ot the program to Ihose who have 
purchased ihe software. Ttie mechanism also permits HP sales 
personnel to demonslrate Ihe program on any avarlable machine. 

The HP TechWriter package includes a nghi-to-use cerlificaie, 
which has a serial numt^er and instructions for contacting Hewlelt- 
PackarO lo obtain a code word. The codeword provided is based 
on the certificate number and the model and serial numbers 
contained in the ID PROM of Ihe user's HP 9OD0 Series 200 
Computer, and is valid only for lhal machine 

When HP TechWriter is executed, the program looks for a code 
\M)rd in the contiguraiion file It no code word is found or the 
code word does nol agree wilh what the 10 PROM model and 
serial numbers indicaie i| should be, Ihen the program will not 
allow disc wMte operations. By allowing HP TechWriter to execute 
in either case, the functionality of the program, including reading, 
modifying, and listing existing disc files, can be easily de- 
monstrated Some example text and pictures tiles are provided 
with the product for this purpose However, it is not possible to 
save the results of the Editor on disc, so it is difiicult lo do useful 
work when the correct code word is not present. 

HPTechWriier allows one contiguralion tile !o be used by more 
than one computet by allowing multiple code word entries in the 
configuration file This means that a shared resource disc or a 
removable disc volume may be used by more than one computer 
using HP TechWnter by including a codeword for each computer 



document on a wide range of printer types. Several teclmi- 
cal problems were solved to make this possible. Different 
printers have a variety of resolutions, features, and inter- 
faces to those features. The HP Printer Command Language, 
HP-PCL, is now slandardi:/ing feature interfaces, but HP 
TechWriter has to work with existing printers. The variety 
□f printers results from advancing technology and the wide 
differences in cost and uses for printers. We chose to sup- 
port current printers lo make the product useful for our 
present as well as future users, and to offer users a range 



of options in terms of printer speed, price, quality, and 
type. Supporting any one printer would have been much 
easier, but would nol have offered users this range of op- 
tions. The few printers qualified on the HP 9000 Series 
200 that are not supported by HP TechWriter generally do 
not offer graphics and have feature interfaces thai are rad- 
ically different from the HP-PCL definition. 

To produce a rough hard copy ot a picture, HP TechWrit- 
er uses the Pascal library Device-independent Graphics 
Language (DGL) to plot the picture to graphics memory. 
This lakes advantage of that product's fast vector-to-raster 
conversion. HP TechWriter then reads the screen memory 
and dumps raster rows lo the printer. Since there are several 
ways in which graphics memory is organized among the 
various HP 9000 Series 200 Computers (ranging from a bit 
per pixel to a byte per pixel), assembly language procedures 
are used to read whichever graphics memory is in use and 
construct the one-bit-per-pixel representation required by 
all supported printers. 

To produce side-by-side pictures and text, HP TechWriter 
writes the alpha line without advancing the paper, then 
writes one alpha line equivalent of graphics rows to the 
printer. Side-by-side text and graphics requires a printer 
that can write alpha lines without advancing the paper. 

Pictures are positioned and sizbd in the document based 
on screen text columns and lines. The HP 9000 Series 200 
Computer display screens vary in their graphics-dots-to- 
alpha-cell ratio, and no printers match any of the graphics- 
to-alpha dot ratios of the screens, Therrafore, pictures pro- 
duced this way tend to look different on the screen and in 
hard copy, as shown in Fig, 5. 

In general, the printers have narrower and taller character 
cells than the HP 9000 Series 200 displays. To maintain 
the aspect ratio of the picture as specified in the document, 
pictures generally print short in the X direction and long 
in the Y direction wilh respect to the surrounding text. 
The exact discrepancy, of course, depends on which printer 
and display are in use. Some match the height ot the ceil 
in dols exactly, and others may be short in the Y direction. 

HP TechWriter dealt with this problem initially by com- 
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Fig. 5. Because the ratio of graphics dots to alpha dots vanes from display to display and 
trom printer to pnntef. text may look different on the screen flett) and in hard copy (right), 
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Fig. 6. Using an escape character (~ in this case), the user can take advantage ol all of the 
prmler's features Here !he input snown in (a) produces the formula shown in (b) 



puling the difference in resolution between the printer and 
the screen in each dimension, then allowing the user to 
specify what to do with the difference. The Editor still 
produces pictcces in this way, providing draft-quality pic- 
ture hard copy. The recently upgraded version of the Lister 
goes one step farther; it plots the picture using the correct 
horizontal and vertical character cell size in graphic dots 
for the particular printer in use. then prints that informa- 
tion. Pictures printed from the Lister use all of the graphics 
resolution available from the printer, and can be larger in 
both dimensions than pictures printed by the Editor, These 
larger, printer- density pictures are called "final-quality," 
and look as good as the printer can produce. 

User Ingenuity 

Every printer is different and every engineer has different 
specifications for documents. HP TecbWriler ikies nol at- 
tempt to address all combinations. It provides a minimum 
funclionality, then gives the user the ability lo program 
extended functionality into the document. 

For instance, the spacing command was introduced to 
allow production of double-spaced documents. The spac- 
ing command lots the user set any spacing desired, includ- 
ing zero spacing (i.e., overprint]. By using this feature, the 
user can produce pseudobold output or special combina- 
tion characters. 

The most powerful feature of this Idnd is the escape 
character. The e.scape character packs the throe digits im- 
mediatelv following it into one byte to send to the printer. 
Using the individual printer *.s manual, the user can take 
advantage of all features provided by the printer. For exam- 
ple, if the escape character is a the sequence shown in 



Fig. 6a causes an HP 2934A Printer lo print the formtila 
shown in Fig. 6b. 

Briefly, the printer is instructed to select a math character 
set as the alteniale character set by the command -027]IM. 
The ■ 015 directs use of the normal character set, and -014 
specifies the alternate character set. The -027= forces a 
half-line line feed. Note that this example uses half-line 
spacing since the HP 2934A Printer does not have subscript 
or superscript commands. Half-line spacing moves the 
paper up while spacing is turned off ( s n=0|. thereby 
causing a mismatch between the line-on-page count and 
the actual Une on the page. This will cause the footer to 
be misplaced on the page. To reposition the footer correctly, 
either the page could be half-line spaced up again, or the 
number of lines per page for the printer could be doubled 
for one line and a blank line printed. Printing two lines at 
twice the lines per page in this way would result in one 
blank line on the output and would correctly reset the 
line-on-page count. 
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Magnetostatic-Wave Devices for 
Microwave Signal Processing 

By locally perturbing the magnetic dipoles formed by 
spinning electrons in thin ferrimagnetic films, a propagating 
wave can be initiated. Devices based on this principle can 
be used to process microwave signals- 



by Waguih S. Ishak and Kok-Wai Chang 

MAGNETOSTATIC WAVES propagating in a thin 
film of ferrimagnetic material pruvide nn allractive 
means for processing signals in ihe ().Fi-lo-26,5-GHz 
frequency range. In addition to llieir application as tunable 
delay lines, couplers, and signal-to-noise enhancers, tiie 
simple fabrication steps and the excellent performance of 
magnetoslatic-wave (MSW) devices makes them very com- 
petitive with other microwave device technologies such as 
yttrium-iron-garnet (YIG) spheres in such applications as 
filters and oscillators. This report reviews some of the basic 
principles of MSW device technology and describes the 
current state of the art and some of its applications. 

History and Current Status 

Early oxperimenls wilfi magnetoslatic waves in bulk YIG 
material were begun in the late 1950s to demonstrate tun- 
able microwave delay lines for use in pulse compression, 
frequency translation, and parametric amplifier circuits. 
However, these devices did not reach product engineering 
status because of basic material problems such as the 
nonuniform internal magnetic field inherent in all nonel- 
lipsoidal ferrimagnetic shapes' such as rods and bars. The 
bulk delay lines demonstrated at that time had more than 
30 dB of insertion loss and exhibited very limited dynamic 
range. 

With the advent of liquid-phase epitaxy techniques tor 
growing high-quality, single-crystal YIG films linilially de- 
veloped for the fabrication nf magnetic bubble memory 
devices), a renewed interest in MSW devices started in the 
mid 1970s. Since thin [1 to 100 ;j,m) YIG films can be grown 
now with less than one defect per square inch, and because 
these films exhibit a uniform internal magnetic field 
throughout most of their cross sections, which reduces 
losses, magnetostatic-wave delay lines are presenOy being 



I 




Fig. 2. Graph ot areas of operation tor propagation of mag- 
netostatic volume and surface waves in thin V/G liims. 

built with less than 5 dB of insertion loss at 10 GHz. Fur- 
thermore, the thin-film geometry lends itself to the use of 
integrated circuit manufacturing techniques, resulting in 
high device yield and excellent repeatability of perfor- 
mance.^'^ 

MSW Propagation 

Magnetostatic-wave propagation in thin ferrimagnetic 
films has been extensively considered. Three modes — mag- 



Microstrlp Ttanaducers 





Ffg.1. Three magnsfostatic wave 
modes as determined by the 
orientation ol the propagation vec- 
tor s and the direction ol the exter- 
nal magnetic bias field ' <aj 
Surface wave mode. <b} Back- 
ward volume wave mode, fcj For- 
ward volume wave made 
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nelostatic surface waves, backward volume waves, and for- 
ward volume waves— are of inierest tor device applica- 
lions. These modes are determined by the orientations of 
the external magnelic bias field H^, and ihe propagation 
direction s. as shown in Fig. 1 . These modes are dispersive 
and characterized by a narrow propagation passband and 
limited magnetic bias field tunabilrty. The latter property 
can be used to choose a particular wavelength at any par- 
ticular frequency in the passband. Table 1 summarizes the 
important dispersion characteristics of these three modes, ^ 
where: 

0 — wave number = 2 ti/ wavelength = ZttfK 
4ttM = YIG saturation magnetization 
7 - gyromagnetic ratio = 2.8 MMz/'oersled 
H, = internal field = Hf, + H^i„,„py 

The potential bandwidth of these MSW devices is given 
by luji - u),. where ui,, and tO| are as defined in Table I. For 
example, these expressions predict a bandwidth of more 
than 2.2 GHz at a cenler frequency of 10 GHz for a ma^eto- 
static forward volume wave device. 



As seen from the oi-0 dispersion diagrams in Table 1, the 
group-del ay- versus-frequenry characteristics of these three 
MSW modes are highly nonlinear. Fortunately, it is possi- 
ble to modify these dispersion characteristics by introduc- 
ing an external conductive plane in the vicinity of the YIG 
film or by growing a multilayer structure of a YIG film on 
3 gadohmum gallium garnet (GCG] substrate. The separa- 
tion between the YiG film and the conductive plane and'or 
the different thicknesses of Ihe layers in the multilayer 
structure are parameters that can be controlled to determine 
the dispersion properties of these devices. 

Propagation lasses of MSW devices are proportional to 
the ferrimagnetic resonance linewidth AH. These losses (in 
dBlfis) are given approximately by 76.4 iH. where AH is 
in oersteds. Because AH increases approximately linearly 
with frequency, the propagation loss increases in a like 
fashion with frequency. I^esently, pure YIG films are grown 
with AH less than 0.5 oersted at 10 GHz, resulting in a 
propagation loss of less than 36 dBifis for X-band delay 
lines. 

The various properties discussed above can be simi- 
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Magnetostatic Waves fn Thin YIG Films 
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Magnetic Resonance and YIG-Sphere Devices 



Magnetic resonance is a phenomenon found in magnetic sys- 
tems lHal possess both magnetic moments and angular momen- 
tum According lo classical theory, an atomic magnetic moment 
behaves as a gyroscope in the presence of an external static 
magnetic field. The term resonance implies thai the atomic mag- 
netic moment is in tune with a natural frequency of the magnetic 
system In this case, the frequency corresponds to the rale ol 
gyroscopic precession of the atomic momeni about the direction 
of the external field. 

Resonance occurring in ferrimagnelic materials such assingle- 
crysial yttnum iron garnet (YIG) was first studied by C Killel in 
1948 Since then it has been used to determine several material 
parameters such as relaxation lime, crystal anisolropy and mag- 
netostriclion In 1957. Dillon observed very low ferrimagnetic 
resonance Itnewidlh values for single-crystal YIG spheres and 
found lhat impunties, especially relaxing ions, and surface rough- 
ness Increase linewidth considerably because of the excitation 
ol spin waves. 

Fig. 1 shows an alom with its associated magnetic moment 
created by a spinning electron The magnetic moments of the 
atoms in each molecule of a YIG crystal combine to form a strong 
magnetic dipole Application of an external magnetic field Ho 
causes these magnetic dipoles to align themselves in the direc- 
tion of the field This produces a strong net magneti?ation M 
Any magnetic lorce acting at tight angles to the bias field 
results in precession of the dipoles around the field at a rate that 
depends on Ihe strength of the field and, to a limited degree on 
the basic properties of the material Such lateral forces may result 
from RF magnetic fields orthogonal to Hq. If the frequency of the 
RF tield coincides with the natural precession frequency of the 
magnetic dipoles m the matenal. a strong interaction resuHs. 

For spherical YIG resonators, the resonance frequency is given 
by; 

f. = v(Ho 4- MHz 



marized in a graph defining the area of operation for differ- 
ent magnetostatic wave modes. Fig. 2 shows the frequency 
range of YIG-film magnetostatic volume and surface wave 
mode devices as a function of the internal field H,. 

MSW Delay Lines: How They Work 

Fig, 3 shows a YIG film grown on a GGG substrate and 
placed in an external magnetic bias field H^, The magnetic 
moments created by Ihe spins of Ihe bound electrons in 
the YIG film pcecess around the direction of the bias field 
in a manner analogous to the behavior of a g\'roscope in a 
gravitational field. The gyroscope corresponds to the spin- 
ning electron and the force of gravity corresponds to the 
external bias field. Because these magnetic moments are 
coupled, a local perturbation caused by a current- carrying 
conductor (input transducer] creates a disturbance [wave] 
that propagates through the YIG film in a manner similar 
to that shown in Fig. 3b, When this magnetic disturbance 
reaches another conductor (output transducer] sometime 
later, it induces a current in the conductor similar to that 
initiating Lhe disturbance, thus forming a simple delay line. 

The generation and propagation of the magnetic distur- 
bance can be likened to plucking a guitar string, which 



M 




Fig. 1, ;n a crysla/ of ferrimagnelic materiai. the spMng 
eiecirons arculating around the nuclei of the atoms forming 
the crystal lattice generate a strong magnetic moment M that 
processes around the direction ol an external magnetic Held. 

where Ho is in oersteds, Hg is tfie anisiropy field in oersteds, and 
■y IS the gyromagneiic ratio (charge-to-mass ratio of an electron), 
which has a value of 2.8 MHztoersted For more than two de- 
cades YIG spheres have found applications as practical micro- 
wave components such as bandpass and bandstop filters and 
resonators tor oscillators Because the resonance frequency can 
be changed by varying the strength of the bias field, YIG-sphere 
devices are widely used in such applications as swept receivers, 
tunable signal sources, test equipment, and eiectronrc coun- 
termeasure systems operating at microwave frequencies over a 
bandwidth of several octaves The interested reader desiring 
mofe details can consult the reference listed below. 

Reference 

NK Osbrmi". VIG-iuned OscdlaTDr FundarTienlals." Mjciawsve SYSfem^ Nb//s. 
NovBmbBr 19fl3, PD 107-I2i, 




Fig. 3. fa,l Under unilorm field conditions, Ihe magnetic mo- 
ments in a thin ferrimagnetic film precess at the same rate 
and have the same phase tbj When an RF field is applied 
to the microstnp transducer, a local perturbation of the mag- 
netic lield occurs, which ihen propagates through the fifm via 
the coupling ol adjacent magnetic moments. 
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Fig. 4. Sasic conliguration of a magneiostaoc-wave Oelay 

line 

causes a vibration to travel along the string. If the lension 
of the string is increased, it vibrates al a higher frequencv- 
!n a like manner, if the strength of the rtiagnetit: bias field 
is increased, the MSW delay line can be tuned for a higher 
frequency. 

Fig. 4 .shows a basic MSW delay line configuration. The 
input and output transducers, usually part of a microstrip 
circuit, are deposited on a dielectric substrate such as 
alumina or sapphire with a ground plane on the bottom 
surface and then brought close to the YIG/GGG structure. 
The thickness of the dielectric substrate delermines the 
distance between the VIG film and the ground plane, and 
therefore, determines the dispersion nharacleristics of the 
delay line. This dispersion relation is plotted in Fig. 5a for 
a magnetostatic-suif ace-wave delay line using a YIC film 
20 fj.m thick placed 254 ;:tm away from the ground plane 
and an applied bias field of lOOD oersteds. 

A more important parameter for delay lines is the group 
delay t^. which can be obtained by differentiating ^ with 
respect to oi (2-Trfl. Fig, 5h shows t^jBS a function of frequency 
for the delay line de.scribed above. Notice the relatively 
flat delay region from 4.85 GHz to 5.f)5 GHz followed by 
a dispersive delay region above 5,05 GHz. Because MSW 
devices can be luned by varying the bias field, these delay 
lines can be luned electronically io have either a constant 
or a dispersive delay as shown in Fig. 6. In Fig. 6a, the 
device, which uses a doped, lB.7-ftm-thick YIG film, is 
tuned Io a center frequency of 840 MHz using an applied 
bias field of 85 oersteds, resulting in a nearly constant delay 
of 150 ns over a 120-MHz bandwidth. Without changing 
the bias field, the same device can be used as a dispersive 
delay line at a center frequency of 94tl MH/ wilh a disper- 
sion of about 1.2 ns/MHz over a 120-MHz bandwidth as 
shown in Fig. 6b. By reducing the bias field to 70 oersteds, 
the dispersion can be recenlered at 840 MHz as shown in 
Fig. 6c, This process can be realized over the tuning range 



of the MSW delay line (for this specific device, the timing 
range is 640 MHz to 2.4 GHz). 

Various methods have been suggested to increase the 
bandwidth of linearly dispersive MSW delay lines. In one 
method, the linearization of the dispersion characteristic 
is achieved by varying Ihe ground plane spacing in a YIG- 
dielec trie-ground -plane structure,* as shown by Ihe cross 
section of the device in Ffg. 7. The frequency response of 
tills device is plotted in Fig. 8, which shows a linear disper- 
sion over a bandwidth of more than 500 MHz at a center 
frequency of 3 GHz. 

A nondispersive, electronically variable lime-delay unit' 
can be constructed by cascading a dispersive magnetostal- 
ic-surf ace- wave delay line with a batkward-volume-wave 
delay line as shown in Fig. 9a. When the individual delay 
lines are properly adjusted, a nondispersive response for 
the composite device can be obtained over a significant 
bandwidth (see Fig. 9b), Sy adjusting the bias field for 
either or both delay lines, the time delay for the composite 
device can also be controlled, as shown in Fig. 9c for an 
actual device operating al 3 GHz, 

MSW Filters 

Because ihe bandwidth of magnetostatic-wave delay 
lines can be controlled by the transducer design, these de- 
vices can be used in lunablc wideband and narrowband 
filters al microwave frequencies. In fact, there is very little 
competition from other technologies for MSW lunable wide- 
band filters. 

When multielement grating transducers are used to make 
narrowband low-loss filters, bandwidths as narrow as 15 
MHx have been obtained for the S (2 to 4 GHz), C (4 to 8 
GHz), X [8 lo 12 GHz], and K (18 to 27 GHz] frequency 
bands. Fig, in shows one such filler using seven-elemenl 
gratings on a 254-/i,m-lhick sapphire substrate. Fig. 11 
shows a mullitrace plot of the frequency response of this 
filler from 3 lo 7 GHz, The ripples are as low as 0,1 dB, 
poak-to-peak, and thenear-sideloberejeclion is greater than 
25 dB, 

MSW Resonators 

One-port and Iwo-porl tunable raagnefosta lie -wave reso- 
nators can be fabricaled using microstrip transducers for 
coupling and periodic etched-groove gratings as frequency- 
selective reflectors." 

At Hewletl-Packard, a special MSW resonator struclure 
was developed lo reduce the insertion loss and increase 
Ihe Q. This device, called a straightedge resonator (SER),^ 




Fig. 5. Dispersion (a) and group 
delay (tj versus frequency tor a 
magnelosla!ic-Burtace-wave de- 
lay line using a YIG film 20 nm 
thick placed 254 fun away from 
trie grouna plane. The applied rriag- 
netic biss held is 1000 oersteds 
and AttM = 1760 gauss. 
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Spin Waves and Magnetostatic 

Waves^ 

Fig. 1 a is a sctiemaiic reoreseniaiion oia ferrimagnelic material 
ol ihickness 1 A chain of magnetic dipoles is shown precessing 
in a uniform lenimagnettc resonance mode under the mlluence 
of an exiemal bras field H„ and an RF field, hnp. applied normai 
to Hq. This uniform precession mode is equivalent to a spin wave 
of zero order with an infinite wavelength. Increasing Ihe amplitude 
of tipiF will increase the angle of precession wiih respect to 
without changing the phase ol the precessing dipoles II the 
angle of precession of a single dipoie Is arbitianly increased (as 
a result of a localized increase in Iirf], ihe nearest neighboring 
dipoie will try to increase its precession also, by axiraciing energy 
from ihe internal exchange tield This process then continues to 
other neighbonng dipoles, resulting in a volume spin wave of 
wavelength 21/3 as shown in Fig. lb The process ol energy 
Iransilion from the RF field to the precessing dipoie system is 
followed by an energy transition to the crystal lattice This latter 
transition occurs either through the spin-spin relaxation channel 
or through the spin-lattice teiaxation channel Eventually Ihe 
energy is convened into heat 

When the interaction between the RFetectromagneticfieldand 
the magnetic dipoie spin system inside the YIG material is strong, 
the prevailing coupling mechanism among the spins is the mag- 
netic dipolar field and ihe exchange effects are negligible In 
this case, Ihe spin waves are called magnetostatic These waves 
are slow, magnetically dominated, and highly dispersive They 
travel with velocities in the 3-io-300-km/s range and exhibit 
wavelengths from 1 ;im to 1 mm. 

Reference 

1 G Winkler Masne'it Gflmefs, Fneor Vieuveg & Sotm, isai 
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Fig. 1. Under Ihe influence ol unilorm magnetic fields (a), 
Itie magnetic dipoles in a femmagnetic material precess at 
the same rate and have the same phase For this case, the 
wavelength = ^. It the RF magnetic field is not uniform (b). 
a volume spin wave is set up where the phase of neighboring 
dipoles differs slightly and this difference propagates wth 3 
wavelength = ?!/3, where t is the thickness of the material. 




Fig. 6. Group delay (lowertrace. 50 nsldivl and phase (upper 
trace. 90 degrees/divj characlenstics for a tunable magneto- 
static-wave delay line versus frequency (10 f/Hzidiv) (a) 
Center Irequenof = 840 MHz, Ho = 85 oersieds, relatively 
constant de^ay (b} Center frequency = 940 MHz, Ho = S5 
oersteds, dispersive delay, (cl Characteristics of (b) tuned 
for center frequency of (a) by decreasing to TOoersteds 
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Fig. 7. Crosssecbon otan MSW delay line using continuously 
varying YIG-filrn-to-ground-plane spacing fo linearize Ihe dis- 
persion cnaracieristic (see Fig 8) 

consists of a ferri magnetic resonant cavity made of a piece 
of YIG film on GCG cut into a rectangle by a wafer saw. 
The coupling transducers are gold microstrips patterned 
on a dielectric substrate, such as sapphire. The substrate 
is then mounted on an aluminum block, which forms the 
ground plane. This structure, shown in Fig, 12, is placed 
hftwRen the poles of an electromagnet so that the magnetic 
field is oriented parallel lo the Iransducers. which permits 
coupling to magnetostalic surface waves. These waves 
propagate along the lop surface of the YIG film and are 
reflected onto its other surface at the film's straight edges. 
A circulating wave pattern results that is resonant if the 
following condition is met; 

+ ^_|] = 2Trn 

where n is an integer, fi, and ^_ are the wave numbers 
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Fig. 6. Phase (upper trace) and group delay (lower trace) 
characlerisiics of MSW delay tine constructed with structure 
shown in Fig 7 
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Fig. 9. faj By cascading a tunable backward-votume-wave 
delay tins wnh a hxed surface-wave delay line, a nondssper- 
sive. electronicatly variable time-delay unit can be con- 
stiuded (Sethares. e( af). fbj The combination at the group 
delay characteristics ol the two MSW delay lines in (b> resutls 
in a nondispersive response that can be adjusted far different 
delays by varying Ihe strength ol the external bias field H^. 
(c) Delay characteristics for a device constructed as outlined 
in (a) 

for the top and bottom surfaces, respectively, and I is the 
distance between the straightedge reflectors, Very-high-Q 
resonators have been demonstrated and Fig. 13 .shows a 
typical transmission response [amplitude and phase] al 8,5 
GHz, In addition to their improved performance, these de- 
vices are simpler to construct than the etched -groove-re- 
flector resonators. 

Signal-to-Noise Enhancers 

A signal-to-noise enhiincer is a device (hat exhibits a 
high attenuation of small noise-Ievel signals and a lower 
attenuation of signals whose levels are above a certain 
threshold.'" This is the opposite of the characteristic of the 
more familiar power limiler. A lypicai application for such 
a device would be in a frequency memor>' loop where a 
microwave pulse is to be stored for a specified lime. 

A signal-to-noise enhancer using a YIG film on GGG is 
shown In Fig. 14. When a microwave signal is applied to 
the inpul port, a magnetostalic wave is launched in ihe 
YIG film, which results in an attenuation of the microwave 
signal. For small input signals, most of the energy is used 
lo launch the MSW, resulting in a large attenuation of these 
signals at Ihe outpul port. For larger inpul signals, only a 
fraction of their energy is used to launch the MSW wave, 
resulting in less attenuation al the output port. Fig, 15 
shows the measured attenuation versus frequency for dif- 
ferent inpul power levels. Over a frequency range of 2,7 to 
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Fig. 10. Photographs at a narrawtanti MSW delay-line filter using seven-elemsn! transducer 
gratings on a 254-n/n-ihick sapphire substrate with YIG film in place (left) and removed (rightj 



3.7 GHz, this device can give a 20-dB signijI-to-nDise en- 
hancemenl. A similar device using a plaie of lithium ferrite 
instead of Ihe YIG film gives comparable performance over 
ii frequency range of 5.5 to 7 GHz. 



MSW Delay-Line Oscillators 

Fig. 16 shows a schematic of a microwave oscil I a lor using 
a magnetostatic-smiace-wave delay line for the positive 
feedback path. The oiitpiil of fhfi delay line is fed to a 
solid-state microwave amplifier whose output is connected 
to a directional coupler. Oscillations occur at frequencies 



S2 1 log MAG 

REF -30.0 dB 
1 5.0 dB/ 

V -7.231 dB 




START 3.0000001300 GHz 
STOP 7 . 000000000 GHz 



Fig. 11. Frequency response ol 
the dei/ice showr> in Fig 10 at ten 
values ol the applied magnetic 
bias field. 
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Fig. 1Z Top (aj and side (bj views o! basic straightedge 
MSW resanalar structure.^ 



where the loop gain exceeds unity and the phase shift 
around the loop satisfies the rondilion; 
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Fig. 14, Basic siruciure o! an MSW sigrial-lo-noise enhancer 
(Adam. etar°}. 
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•^dfllay llnH + ^ampUfior + ^cDufiinr + 'Stables — ^TTC 

where n is an inleger. To achieve single-mode oscillations, 
die delay line must be a narrowband device. In the example 
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Ffg. 13. Typical amplitude (curve 1) and phase (curve 2) 
response for an B.5-GHz straightedge MSW resonator. 



disi;ussed here, this was achieved by using multielement 
gratings (or the input and output transducers and spacing 
the YIG film 400 away from the transducers." The 
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Fig. 16. Schematic ol a microwave oscillator using a mag- 
nelostatic-surface-wave delay line lor the frequency-control 
element Oscillation occurs for frequencies where the total 
phase shift around the circuit is a multiple ot 2tt radians. 
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Fig- 17. Amplitude and delay response of an MSW delay line 
using an 18-nm-thick YIG film, a 254-)in>-thick sapphire sub- 
strate, seven-elemen! microsirip transducers, and an external 
magnetic bias liBld of 2H0 oersteds. 



frequency response of a delay line using an 18-Mm-lhick. 
YIG film, seven-element transducers, a 254-/ini-thick sap- 
phire substrate, and an external bias Held of 2140 oersteds 
is sbown in Fig. 17. The 3-dB bandwidth is 15 MHz, the 
insertion loss is 6 dB, and the near-sideiobe rejection is 25 
dB. 

When this delay line was used in the configuration of 
Fig, IR, the oscillation frequency was tunable from 4 GHz 
to more than 24 GHz. A sel of speclrum analyzer measure- 
ments is shown in Fig. 18 for oscillations al 4.77, 10.14, 
16.43, and 23.78 GHz. Frequency jumping was observed 
al several points, which was attributed to the excessive 
delay associated wilh the cables and the multistage 
amplifier. Integrating this oscillator circuit should elimi- 
nate this problem. 

The oscillator's tuning characteristic, shown in Fig. 19, 
exhibits an essentially linear behavior. The phase noise of 
the oscillator was measured using an HP 3047A Phase 
Noise Measurement System and an HP 11729B Carrier 
Noise Test Set. An excellent singie-sideband noise of better 
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Fig. 18. Frequency response of circuit shown in Fig 16 using the MStV deley tine whose 
ctiaracteristics are s/iown in Fig 1 7, Tftys oscillator is tunable from 4 GHz to more than 24 GHz. 
as shown by Ihe spectrum analyzer plots lor oscillation at (a) 4 77 GHz. (bj 10 14 GHz. (c) 

76.43 GHe. and (dl 23.78 GHz 
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Fig. 19. Tuning cnaractenstic of oscillator described by Figs. 
J6, !7, and je. 

than -110 dBcwas measured for a 10-kH/ frequency offset 
when the oscillator was luned to 5.5 GHz |Fig, 20], This 
superior phase-noise characteristic and the simple IC-like 
fabricalion and assembly of magneto static- surface- wave 
delay lines make magnetoslafic-wave oscillators an attrac- 
tive alternative to YlG-sphere oscillators. 

HSW Frequency Multipliers 

Another appli(,aIion tor narrowband and wideband MSW 
delay-line filters is in frequency multiplication. Currently, 
microwave frequency multiplication is achieved using a 
step-recovery diode to generate the harmonics of an input 
signal. The diode is then followed by a YIG-sphere filter 
to choose the desired harmonic. One problem encountered 
in this configuration is the connection that must be made 
between the microstrip output from the step-recovery diode 
and the coupling loop around the YIG sphere. Another 
problem is the large shape factor of the YIG-sphere filter, 
which sometimes results in unacceptable harmonic sup- 
pression. If an MSW delay-line filter is used instead of the 



YIG-sphere filter, both problems can be solved. Fig. 21a 
shows the output of such a multiplier for an input signal 
of 1.682 GHz. The chosen output is the second harmonic 
and the suppression of the other harmonics is 50 dB. Fig. 
21b shows the oulput of the same multiplier for the same 
input signal, but with the MSW filter luned tor the third 
harmonic. The harmonic suppression is again 50 dB. 

Conclusions 

Magnetostatic*wave devices and systems offer special 
capabilities in the UHF and microwave frequency bands, 
and have potential in mi Hi meter- wave applications. Tuna- 
ble MSW delay lines provide advantages over coaxial ca- 
bles, especially al high microwave frequencies. Dispersive 
delay tines are attractive for pulse compression applica- 
tions, Wideband tunable filters can be used in microwave 
sweepers and amplifiers for harmonic suppression, while 
narrowband filters can be used as preselection elements. 
Signal-to-noise enhancers provide the capabilities of im- 
proving the signal-to-ooise performance of signal generators 
and the performance of frequency memorj' loops. 

Systems built using MSW devices should find wide ac- 
ceptance in instrumentation and communications applica- 
tions because of the compatibility of the technology with 
existing microwave integrated circuits. As described above, 
tunable oscillators covering the 500-MHz-to-25-GHz fre- 
quency range are feasible. In addition to their excellent 
phase noise performance, these systems are easier to build 
than YIG-sphere oscillators, which have stringent polishing 
and mounting requirements. Another example is in the 
area of frequency multiplication, where an MSW filter can 
be used to select the desired harmonic at the output of a 
step -recovery diode. 
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Disc Caching in the System Processing 
Units of the HP 3000 Family of Computers 

Disc caching uses the excess main memory and processor 
capacity of the high-end HP 3000s to eliminate a large 
portion of the disc access delays encountered in an 
uncached system. 

by John R. Busch and Alan J. Kondoff 



THE PRICE/PERFORMANCE RATIO of processors 
and llie cosf per bit of semiconductor memory have 
been falling rapidly in recent yesrs. while moving- 
head discs have been getting denser but not much faster. 
The resulting increase in the ratio of access times between 
secondary and primar>' storage, coupled with a reduction 
in the required number of secondary storage servers, has 
caused low processor utiliiialion in many systems. Propos- 
als to address ihis problem have focused on; 

■ Reducing the basic disc access time through ihe use of 
higher-speed actuators and rotation rates or through mul- 
liple actuators and heads. 

■ Reducing the effective disc access time by front-ending 
one or more discs with a semiconductor disc cache pro- 
viding multiple track buffering or by inserting an inter- 



Glossary 



Read hit rate. Ttie probability lhat the disc file information to be 
read will be lounO in a higher-speed memory and will not tiave 
lo be Obtained from a slower one 

Read miss. Failure lo find the desired data in a higher-speed 

memory 

Nowait. A lype ol input,'oulpul in which a process requests an 
t.'D operation and then proceeds with other processing wlille the 
l.'O operation completes 

Write wait probability. The probability ihai a process will have 
to wail for a disc write operaKon to complete after it has Initiated 
a nowail write 

File mapping. A type ot disc caching in which disc files are 
addressed as virtual memory locations, wilh Ihe processor 
hardware and memory manager performing physical location 
and file caching lunciions 

Disc domain. A contiguous section of disc storage containing 

file cJala 

File extent. A sequence oi lile lecords stored in a contiguous 

disc region 

Posting- Updating a record or tile in us permanent home in 

nonvolatile disc storage 

MIPS, lylillion instructions per second 

LRU. Least recently used. A crderion for removing cached disc 
domains from cache storage to make room for currently needed 
domains. 

Shadow paging. A transaction recovery mechanism in which 
file changes of uncommitted transactions are saved In temporary 
disc locations until transaction commit lime, whan the temporary 
changes are made permanent 



mediate level in the storage hierarchy between the discs 
and main memory using CCD (charge-coupled device), 
magnetic bubble, or semiconductor RAM technologv- 
■ Reducing the frequency of disc access through local and 
global file buffering in ihe main memory. 
The research and development effort reported here 
examined alternative approaches to exploit current trends 
in processor and memory technology' to realize significant 
improvements in system price/performance ralio, and 
applied the results to the HP 3000 family of computers. 
The solution we selected for the HP 3000 family was to 
apply the excess main memory and processor capacity of 
ihe newer systems to prefetch and cache disc regions and 
deliver data when requested at main memory speed rather 
than disc access speeds,' The approach eliminates not only 
a significant portion of the traffic between primary and 
secondary storage, hut also the bulk of process delays wait- 
ing for the traffic ibal remains. Through the integrated man- 
agement of data base, file, and transient object space, it 
matches data management requirements and the architec- 
ture of the HP [)Oni) family with the ciurent trends in pro- 
cessor ami memory technologies. 

In this report, we analyze the altomatives for exploiting 
processor and memory technology trends with respect lo 
cosi, performance, and reliability. We discuss disc caching 
design alternatives including fetch, replacement, and write 
Iiandling policies. We present an overview of Ihe tools 
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Fig. 2. Perfamance oi systems using convenlional memory hierarchies wittioui disc caching 
Jransaclion throughput is stiowt^ as a funcuon of eftective multiprogramming level (MPLj. the 
number ol processes !ha! can be concurrently executed. The top graptis show throughput 
versus CPU speed lor a fixed number ol discs. The bottom graphs show throughput versus 
number of discs lor a fixed CPU speed. 



developed for the performance analysis of the alternatives, 
and we present rneasurements of the system and subsystem 
behavior and performance of our implementation for the 
HP 3000 family. 
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Disc caching is standard on HP 3000 Series 42. 46. and 
68 systems and on HP 3000 High-Performance Series 39 
systems. It is supported by the MPE-V;P and MPE-V'P Delta 
1 operating systems, and by all releases of the MPE-V/E 

(conimueQ on page 24) 
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Disc Cache Performance Tools 



To evaiuate ihe pertoffriance of allemarive system archiiec- 
Uites art) managemeni Doitces, perfonriance neasuteineni and 
analysts tools were consirk^cied Insirumentaiion was defied, a 
slalislics upclaiing ard reooimg suDsysis"! *as Oes'gned and 
constructed, and display tools were specrfied and budl WoiKloaa 
generato's were collected, and moittioad cnaractetizers were 
consiructed A simuianon was Duiit 10 anaiyje seconaat^ storage 
caching, and a system model was consifucted fo' analysis ot 
Itie impact of disc caching across a Dread range of configurations 
ana workloads. Rg 1 shows the relationship ot the various per- 
(oimance measurement and analysis tools that were constfucied 
for this research 

The measuremerr! subsystem supports low-overhead, event- 
driven statistics gathering Statistics including resource queue 
length and service time histograms, service class compositions, 
transition distributions, and resource-specific event frequencies 
are supporiad by the processor mam memory, disc, and 
semaphore managers. A statistics display iooi allows the mea- 
surer !o specify the range of statistics to be measured and the 
desired duration of tneasuremeni 

To anaiyze the disc subsystem in delaii, a trace-driven disc 
workload characterize! and disc cache simulator were specified 
and buiii These tools provide msight into the worMoad's disc 
traffic and the impact of various forms of disc caching. These 
tools consume disc access trace fiies as input A trace record 
contains the time, accessor class, access transfer count, access 
lunction. and access location lor each disc access initiated by 
the system being traced 

The disc workload charactenzer produces a profile of disc I 'O 
workioad characteristics Thisincludesabreakdownof the work- 
load into Its access type components (data base, directory, se- 
quential access, etc.], interreference times, distributions of trans- 
fer sizes, active extent sizes, and extent LRU (least recently used) 



stack reference depths 

The disc cac-'ie simulaliofi aIio*s refined control ovw cache 
managernenl pofiCies (founding, extent adherence write han- 
dling, fetch sizes for each access type, and flush controll Any 
subset of tt^e accessors can be cached, so thai the localities of 
the access nslhods car be investigated m isolation The 
simulator decomposes ifie disc references into modes and ac- 
cess functions and gives cache hit information for eact- access 
type. In abdtEor to the cache performance infonnaiion for the 
access types, the disc cache simulator aiso gives cacne behavior 
information, including disttiijuiions of cache entry counts cache 
reference LRU siackdepths. and cache interreplacement times. 

The hii rates obtained from the simulations are used as the 
processor-to-disc and processor-to-cache transition prob- 
abilities in the system model Since the hit rates are obtained 
through simulation, main memory size and contention do not 
have to be captured explicitly by the system model This signif- 
icantly reduces the complexity of the system model. 

A custom analytic system model was required sc that a disc 
cache service station couid be explicitly included. This allows 
the user to specify Ihe system configuration and workload charac- 
teristics, and produces giobai performance estimates as weii as 
resource demand and utilization slatistics. The system model 
explicitly captures the effects of alternative secondary storage 
caching mechanisms, including external caching ot discs 
through an intermediate storage level and internal caching of 
discs through file mapping or explicit caching in primary memory. 

Tfie system model queries for workload and configuration infor- 
mation. These inputs are obtained from knowledge ot the msiai- 
iation. from the disc cache simuiation, and from the statistics 
collection and reduction tools inputs are specified for processor 
speed, disc configuration and speed, channel speed, processor 
and disc service demand, disc cache overhead, and disc cache 
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Fig, 1 . Performance measuiemem and analysis rods dei/e/oped to evaluate alternative system 
arclriectiires and management policies 
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read hii rale, fetch size, and posi handling method. 

Given the input sysiem configuration and workload characieri- 
zalion. ihe system model program constructs a closed queueing 
model The total workload is aggregated, and the network is 
decomposed into a simple central server model For internal 
caching, the disc cache sen/ice station is absorbed into Ihe 
processor station The resulting central server model is solved 
through a standard iteiauve procedure^ which iterates on the 
effective multiprogramming level 

Performance and behavioral characteristics are calculated for 
each server and for the system as a whole For each iteration 



on the mean mulliprogramming level, the uiilizalion. mean queue 
length, throughput, and response time ol each server are calcu- 
lated, as are the overall system response lime and throughput 
Serialization caused by contention for access control 
semaphores [e g . data base locl<S) can significantly limil the 
effective multiprogramming level The impact of senalizalion is 
not explicitly captured by Ihe model its effects can be approx- 
imated by reducing the effective multiprogramming level. 

Reference 

1 'j S lavsciberg, Cmrpuier Perto'msnce MotfBJJng Wwitfboa* Acaaemic P'BES. 
New rurk 1983 



(CQfNMnueiJ f'orti page 22] 

operating system. It includes software, additional memory, 
and in some cases, additional fimiM'are. 

Technology Trends 

Fig. 1 (page 21| shoves a conventional computer system 
storage hierarchy. Storage hierachies providu cost-effective 
system organizations for computer systems. Each succes- 
sive level of a storage hierarchy uses lovver-cost, but slower, 
memory components. By retaining frequently accessed 
code and data in the higher-speed memories, the system 
can operate at speeds clo.se to the access times of the fastest 
memories while most of the storage capacity is in lower- 
cost, lower-speed memories. The price and performance 
of a computer system are often dominated by Ihe orRani^Fi- 
lion and management of its storage hierarchy. 

Achievable system performance is a direct function of 
processor speed and utilization. Processor utilization, and 
so the effective exploitation of high-speed processors, is 
limited primarily by the WEiiting time caused by misses at 
various levels oi the storage hierarchy |i.e.. the desired data 
is not foimd in a higher-speed memory and must be ob- 
tained from a slower one]. Thus, for optimal system price/ 
performance ratio, the processor speed and the capacity, 
speed, and management of the levels of the storage hierar- 
chy must be matched. 

Traditional solutions for a low hit rate at a certain level 
of the storage hierarr:hy include improving the m^miigement 
policies of the levels, increasing the rapacity of the level 
incurring the low hit rate, speeding up the access time of 
the nest lower level of the hierarchy, and introducing a 
new level into the storage hierarchy. Cost and technology 
determine which alternative or combination of alternatives 
is appropriate. 
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Processor speeds are rapidly increasing, and costs are 
dropping. The order-of-magnitude advances in processor 
speeds in recent years have not been matched by propor- 
tional advances in disc access speeds. There is an access 
time gap of five orders of magnitude, and widening, be- 
tween a main memory referent:e and a disc reference in 
most current computer system storage hierarchies. By com- 
parison, the access lime gaj) between a processor cache 
reference and a processor cache miss resulting in a cache 
block fetch from main memory is normally less than an 
order of magnitude. 

Disc densities are rapidly increasing, and large- capacity 
discs offer a fourfold price advantage over small-napacity 
discs. This trend in disc technology exacerbates the hierar- 
chy imbalance. Since the disc subsystem often represents 
over 50% of the system cost, the trade-off towards a small 
number nf high-capacity discs is attractive. However, the 
demand for a disc grows with its capacity. Thus, realizing 
the cost advantages of a few large-capacity discs over sev- 
eral small-capacity discs reduces the potential for parallel 
service. This increases the mean disc gueue lengths, and 
thereby the expected values for disc service response time. 

Replacing discs with secondary storage devices employ- 
ing CCD, magnetic bubble, or .semiconductor RAM technol- 
ogy has shown limited cost-effectiveness. Bubble and CCD 
memories have not been able to keep pace with the density 
improvements and resulting drop in cost per byte of 
semiconductor RAM. while the cost per megabyte of 
semiconductor RAM is still two orders of magnitude greater 
than thai of discs. Leading-edge lech not ogles'^'' offer the 
potential for high-density discs with mean access times 
approaching 10 ms. Thus, magnetic-media secondary stor- 
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Fig. 3. External disc caching using a local buffer for each 
disc. 



Fig, 4. External caching al an Intermediate storage level. 



24 HEWLETT-PACKARD JOURNAt-reBlTUARY 1935 



) Copr. 1949-199B Hewlett-Packard Co. 



Wain Memory 



Olsc 



File Blocks 





Fig. 5. Many systems provide limited disc caching in main 
memory an a subsystem, file, or application basis. 



The MPE-IV Kernel 

The kernel of an operating system prouldes higher-level syslern 
software including applications, ciaia bases, and file systerus 
witti a virtual interface to Itie fiardware and primitive system re- 
sources The details ol system conligtiration. of hardware inter- 
facing, and of controlling access lo system resources are hidden 
dy the kernel 

The kernel multiplexes the system resources among the com- 
peting processes through the application of service policies. The 
overall performance of a generai-puroose computer system is 
lo a large extent determined tiy ihe eflectweness of these 
policies. 

As the HP 3000 began lo expand into a family ol computers, 
a new kernel was required lhal would support its evolution A 
research effort was undertaken, resulting m the MPE-IV kernel. 
The research approach and the design and performance of the 
resuliing kerr>el are described in reference 1 Our implementation 
ol disc caching for Ihe HP 3000 family naturally extends the 
algorithms, measurement tools, and facilities of this kernel 

The kernel structure is based on decomposing the system into 
resource managamem modules lhal can be easily replaced, 
while enforcing within a module strong separation of policy Irom 
mechanism so lhal new algorithms can be implemented In a 
leveraged manner Performance was sought through good al- 
gorithms, not in struct I on-level optimization 

Algorithms lor main memory management, processor manage- 
menl, disc access and space management, inierprocess com- 
munication, end semaphore management were developed that 
cooperale to optimize global system performance obfeclives 
rather than local performance objeciives. 

To achieve this cooperation, global priority assignments are 
made to processes based on eWernai policy specifications 
These process prionty assignments ere then 'ellected in the 
service requests for the various system resources so lhal local 
management decisions contribute to the global oljjeclives. The 
resource management algorithms ol the MPE-iV kernel are dis- 
cussed in detail in reference 2 
References 
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age devices and semiconductor primary memories can be 
expected lo remain dominant in computer storage hierar- 
chies in the years to come. 

The results of the direct application of evolving technol- 
og>- are showTi in Fig, 2 [page 22). These graphs use the 
analytic system model described in the box on page 23. 
The graphs show families of curves for increasing efteclive 
multiprogramming level (the number of processes in mem- 
ory demanding service that can be concurrently executed |, 
The lop graphs show transaction throughputasa function of 
processor speed for a fixed number (4) of 25-ms and 10-ms 
discs. The bottom graphs show transaction throughput as 
a funcHon of the number of 25-ms and lO-ms di.scs at a 
fixed processor speed of 3 .MIPS. 

A fixed channel bandwidth of 2M bytes/s is assumed for 
all the runs. Other assumptions were 10.000 instructions 
per disc visit, five reads per write access, IK-byte mean 
transfer size, and five disc accesses per transaction. These 
are somewhat characteristic of the referencing patterns of 
the HP 3000 family, although observed vsriations are wide. 
Changes in these parameters shift the curves, but the gen- 
era! characteristics are the same. 

We see that with conventional storage hierarchy manage- 
ment with this workload and with four 25-ms discs, effec- 
tive processor utilization extends oniy to 1 MIPS, and 
beyond 1 MIPS, performance is linear with re.spect tn the 
number of discs. With faster discs [four 10-ms discs), effec- 
tive processor utilization extends through 4 MIPS, with a 
sharp dependency on the number of 10-ms discs for the 
higher multiprogramming levels. High effective multipro- 
gramming levels art! required throughout to exploit proces- 
sor capacity. 

This indicates that for effective utilizatinn of higher- 
speed processors using conventional storage hierarchies 
and management techniques and high effective multipro- 
gramming levels, ever faster discs in greater numbers are 
required. As discussed above, the technology trends in sec- 
ondary storage devices do nnt support these requirements. 
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Alternative Balancing Approaches 

System thrpughput is diractly proportional lo effective 
inultiprngramming level, and inversely proportional to pro- 
cessor and disc response times and disc visit frequericies. 
Consequentlv, efforts to overcoTne llie limitations in exploit- 
ing the trends in processor and memory technologies have 
focused on: 

■ Reducing the effective secondary storage access lime 
through external disc caching techniques 

■ Reducing the number of secondary storage visits per 
transaction through internal disc caching in the primary 
memory 

■ Sustaining large effective multiprogramming levels 
through improved concurrency management schemes. 
In the next two sections, methods of reducing effective 

disc access time and visit frequencies are examined. Mod- 
ern methods for achieving high concurrency with data ac- 
cess in transactional systems are addressed in references 
4 through 8, 

External Caching Techniques 

Caching techniques have been applied to the disc subsys- 
tem to reduce the effective disc access time to secondary 
storage. Caching has been implemented in discs, in control- 
lers, and as stand-alone system components interfacing one 
or more secondary storage devices to the main memory. 

The use of a local cache per disc is depicted in Fig. 3. 
Sraith^ discusses buffering discs using bubble, CCD. or elec- 
tron beam memories. He concludes that three buffers, each 
a cylinder in size, would produce a hit ratio on the order 
of 96%, with LRU (least recently used) working well as a 
replacement policy, IBM has announced an intelligent 
cached disc featuring a 384K-byte microprocessor-driven 
controller that optimizes seeks and caches recently refer- 
enced data.'" Krastins" discusses a cache consisting of IM 
to 2M bytes of RAM thai is integrated with the disc control- 
ler. The cache buffers full physical tracks. He reports a hit 
rate of 85%, a mean access time of a to 12 ms, and hit 
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processing time less than 1 ms. 

The use of a cache front-ending the disc subsystem is 
depicted in Fig. 4. This can be viewed as inserting a new 
level into the storage hierarchy. 

The use and organization of a CCD memory level that 
could serve to close the gap between semiconductor and 
disc access times are discussed in references 12 and 13. 
and the use of magnetic bubble memory systems for this 
purpose is discussed in references 14 and ^5. But since 
bubble and CCD memories have nut been able to keep pace 
witli the density improvements and the drop in the cost 
per byte of semiconductor RAM, they have not qualified 
in gap-filling efforts. Rather, the technology of choice for 
external disc caches has been semiconductor random ac- 
cess memory. 

Hugelshofer and Schultz"' describe such a semiconduc- 
tor disc cache marketed by Computer Automation, Inc. It 
consists of 2M bytes of Rj\M placed between the processing 
system and up to four moving-head disc drives. They quote 
access times of 4 ms on a hit, with up to an 85% hii ratio. 
It is packaged with four RAI^ cards and an LSl-2 i/0 pro- 
cessor. 

The use of external caching, either locally or through an 
intermediate storage level, reduces the effective secondary 
storage access time on read hits, and potentially decreases 
the access time on writes, provided immediate physical 
update of the disc madia is not required on a write access 
before signaling transfer completion to the processor. 

Using Large Primary Memories 

With the improvements in memory densities and access 
times, very large main memories can he cost-effective pro- 
vided they can be exploited to reduce the traffic between 
main and secondary storage. Techniques lo exploit main 
memory for this purpose include auxiliary local buffering 
in apphcations and subsystems and global disc caching 
through explicit caching or file mapping. 

Systems have conventionally provided limited caching 
of the discs in main memory through explicit buffering on 
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a subsystem, file, or application basis as shown in Fig. 3. 

Centralized buffering schemes are employed in the 
Berkeley 4.2 UNIX file system for the Digital Equipment 
Corporation VAX computer family'" and in a product for 
the IBM Personal Computer.'^ As shown in Fig. 6. a fixed 
portion of main memory is set aside for global file buffers. 
When a file block read from disc is requested, the global 
buffers are checked tirsl. if the requested block is present 
in a global file buffer, the read is satisfied with a move 
from the buffer lu the user's address space. Otherwise, a 
global buffer is freed, a disc read of the block is initiated 
into ihe selected global buffer, and when the read com- 



pletes, the data is moved into the user's space. File block 
writes are performed through global buffers as well. 

In global disc caching, main memory partitions disappear 
and cached disc regions containing file data are centrally 
managed with the pieces of transient objects by the main 
memor\- manager. In this approach lo disc caching in main 
memory, pieces of the disc are mapped as data objects, and 
placed and replaced by the normal memon,- management 
algorithms like those used for code, stacks, etc. This ap- 
proach is shown in Fig. 7. 

Global disc caching in main memory can be either 
explicit beneath the disc access interface or implicit 
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through the use of file mapping. 

With explicit disc caching in main memory, tha access 
methods continue to address the discs, but a software trans- 
lation mechanism locates the required cached disc domain 
in main memory and moves the data between the cached 
region of the disc and the data area of the access method. 
This approach was implemented in the breadboard system 
for experimentation purposes, as described later in this 
report, 

With architectures supporting large virtual address 



spaces, pieces of tiles or discs or entire files or discs can 
be mapped directly into the address space. The location 
□f a piece of a file or disc in main memory is then handled 
hy the virtual-to-physical address translation hardware, 
and (he normal memory management mechanisms handle 
fetching and replacing of pages of files or discs. This ap- 
proach to secondary storage caching is depicted in Fig. 8. 
This approach was first employed in Mullics.'" and more 
recently as described in reference 20. 
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Requirements of Transaction Management 

Modem transactiona] data base systems guarantee that 
tbe data base is left in a consistent state in the event of a 
transaction abort or a system/disc failure. Solutions lo the 
problem of balancing the storage hierarchy must preserve 



the consistency and recovery principles of the transactional 
system. 

The standard mechanism used to achieve transaction 
recovery is the use of a write-ahead log.'' In ivrite-ahead 
logging, before the data in the data base is modified, copies 
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of the old and new images of Lhe data are written to a log 
file, along with, identification information. If the transac- 
iion aborts, lhe old images of dala modified by the transac- 
tion are read from the log and reslored into lhe data base, 
thereby "undoinp" the actions of lhe transaction. If the 
system crashes, a utility goes through the lag file and un- 
does any actions of uncommitted transactions. If a disc 
fails, the data base is restored to its last bacfced-up state 
and the actions of committed transactions are redone using 
the log file. 

For transaction recovery through write-ahead logging to 
work, the before and after images of the data along with 
identifying information must be posted to the disc before 
the data base is updated with the new values. Most data 
base svstems issue the log write, then wait for the physical 
post to complete before issuing the dala base write. 

Comparative Analysis of Alternatives 

h) the iollowing sections we evaluate the alternatives 
discussed above based on performance, cost, and reliabil- 
ity. We identify complementary approaches, and identify 
workload and configuration sensitivities affecting the suita- 
bility of alternatives. 

Evaluation witli Respect to Performance 

Evaluating with respect to performance, we saw in the 
preceding .section that effective processor capacity utiliza- 
tion drops rapidly with increasing processor speed if con- 
ventional storage management is employed. Faster discs 
and more discs provide benefit in such systems, but the 
benefits are not commensurate with those obtained through 
secondary storage caching. This is demonstrated in the 
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Fig. 13. Potential benefit of disc caching for (fie HP 3000 
tamily Is shown by this graph of read hit rates that can be 
achieved by various access mettiods as a function of cache 
capacity. 
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Fig. 12. Worl^load characteristics of (/Je HP 300D family of 
computers wittiout disc caching. 
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modeling results shown on page 27. The graphs show trans- 
action throughput as a function of processor speed with 
conventional storage managemenl |Fig. Sa), external disc 
caching (Fig. 9b|. and caching of discs in the primary mem- 
ory via explicit global disc caching (Fig. 9cj and file map- 
ping {Fig. 9d|. Assumplioris are the previously specified 
workload characteristits. four 25-ms discs, read hit rates 
o( 85% on disc file accesses, write wait probebilities of 0. 
iK-bylemean transfers mlhout caching, and 4K-byte trans- 
fers with LKU replacement with caching. All of the second- 
ary storage caching ahernatives are able to provide effective 
processor utilization al processor speeds several times 
those that can be effectively used without caching with 
equivalent processor disc configurations. Further, disc 
caching in the primary memory, either explicit or through 
file mapping, is able lo provide effective processor utiliza- 
tion at speeds twice those that can be effectively used by 
external caching, assuming the same overheads and hit and 
wait rales. This is a result of the access time differential 
between main and external cache storage and the higher 
degree of paraUelisra in cache access with primary storage 
caching. 

The overhead of interna! caching becomes negligible as 
processor speeds increase, whereas the overhead in exter- 
nal caches stays fixed. However, with slower processors 
and a balanced cnnfigurafion leading to high processor 
utilization without caching secondary' storage, explicit in- 
ternal caching degrades performance relative to no caching. 
The added overhead of locating the disc region in memory 
and moving the data to the target area does net pay off 
when the processor utilization is high. This effect was ob- 
served in the HP 3000 internal caching measurements. The 
low-end family members degraded in performance when 
explicit internal disc caching was enabled. External cach- 
ing outperforms explicit internal caching in high utilization 
ranges as well, since the data transfer is performed in paral- 
lel with processor use, so the move overhead does not 
consume valuable processor time. 
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Fig. 15. Disc read caching In the breadboard MPE kernel- 

The dependency on disc access lime and ihe number of 
discs is dramatically reduced with the secondary storage 
caching alternatives. This is shown for file mapping in Fig. 
10 for the workload, disc subsystem, and read hit and write 
wait probabilities used in the previous modeling results. 
The behaviors of external caching and explicit internal 
caching are similar. By contrast wilh the linear dependency 
on ihenumber of discs seen in Fig. 2 fora 3-MIPS processor 
without secondary storage caching, full processor utiliza- 
tion can be achieved with only a few discs at 7 MIPS with 
secondary storage caching. Keducing disc access lime from 
25 ms to 10 ms increases the effective utilization range 
from 1 MIPS to 4 MIPS without caching (Fig. 2b), With cach- 
ing, reducing the mean disc service time extends Ihe effec- 
tive utilization beyond 30 MIPS with file mapping [Fig. 1 Oh), 
but processors through 16 MIPS can be effectively used with 
a few 25-ms discs (Fig, 10a|, provided that adequate read 
hit rales and write wait probabilities can be achieved. 

The performance impact of read hit rates and write wait 
probabilities with systems employing secondary storage 
caching is .significant. Fig, n shows the sensitivity of file 
mapping to the read hit and write wail probabilities. These 
are shown for two processor speed.s, :i MIPS and 7 MIPS. 
The sensitivity to read hit and write wail probabilities in- 
creases wilh processor speed, and there is a rapid drop in 
effective processor utilization for higher-speed processors 
as the road hit probability dRcreases and the write waii prob- 
ability increases. This sensitivity to read hit rate and write 
wait probability decreases with reductions in disc access 
time. Thus faster drives enhance caching alternatives, and 
indeed are required if sufficiently high read hit rates and 
sufficiently low write wait probabilities cannot be 
achieved. The characteristit; dependencies of the other 
caching alternatives on read hit rates and write wait prob- 
abilities are proportional to those presented here for file 
mapping. 
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Special mechanisms can be employed to achieve high 
read hit rales and low write wait rates. Since Ihe seuondary 
storage caching allernalives are sensitive to these parame- 
ters, it is important to exploit them. 

Integrating cache fetch, replacement, and wrile handling 
with transaction managomenl requirements is achievable 
with internal caching as described below. With external 
caching, on the other hand, this would be difficult to 
achieve, and high overhead would be incurred because of 
the protocols required between the host and the external 
device. Consequently, cache memory ulilization and the 
sustained effective multiprogramming level of internal 
caching can be superior to that achievable with external 
caching. 

With internal caching, the size of a fetched disc domain 
on a read miss can be tailored to the structure of the data, 
based on knowledge of the storage layout (e.g., fetch extents 
instead of fetching tracks containing unrelated data or only 
pieces of the required extent). The replacement policy can 
exploit operating system knowledge of access patterns (e.g.. 
the policy can flush a cached disc domain from the cache 
memory after sequential reference and on file purging). 
Write posting order and write and read priorities can be 
adjusted to meet the current needs of transaction manage- 
ment. 

Evaluation With Respect To Other Factors 

Develupmenl ul multiple local buffer mEmagement 
schemes is redundant, and the buffering capacity is 
localized and unresponsive to current memory loading con- 
ditions. The amount of memory devoted to the buffering 
of a specific file or subsystem would likely be either exces- 
sive or insufficient at any given moment, depending on 
the current memory availability and the current workload 
demand and priority structure. The problems are analogous 
to those encountered with older memory replacement 
policies based on fixed partitioning. 

Global file buffers require a fixed partition of main mem- 
ory' between swap space and buffer space, and therefore 
arB not responsive lo dynamic memory load conditions. 
They suffer from fixed-partition problems as do local file 
buffers. Furthermore, they provide memory management 
functions, including space allocation, replacement, and 



disc access initiation and interrupt fielding. Therefore, de- 
veloping efficient algorithms for the global buffer manager 
is redundant to the development of the manager of memory 
for transient code and data. Moreover, as is discussed in 
the next section, global file buffers are not amenable to 
supporting efficient local caching in decentralized systems. 

Explicit internal disc caching and file mapping overcome 
the fixed-partition problems, so they can be expected to 
achieve higher read hit rates with dynamic workloads. Fur- 
thermore, it is easier to integrate disc access posting order 
and priority adjustments with these schemes. 

Explicit internal disc caching in main memorv" requires 
a translation mechanism to locate a required cached disc 
domain in main memory. In large main memories there 
can be several thousand cached disc domains present at 
any moment, and locating a cached domain through a trans- 
lation of disc address to main memory address can be ex- 
pensive in processor cycles. Additionally, if the processor 
has a cache, Ihe location sequence can flush a large portion 
of the cache while chasing through list structures. However, 
explicit internal disc caching can be performed with any 
processor architecture. 

File mapping appears to offer the best of the alternatives 
for caching of discs in main memory. It leverdges the main 
memon,' management mechanisms, has no limits on main 
memory space applied to file caching, and has no overhead 
for location of file domains in main memory. Applicabibty 
of this approach is limited, however, to processor architec- 
tures with large virtual address spaces \^2'*^ bytes to map 
many large files) and amenable protection mechanisms to 
restrict access to mapped-in files. Stonebraker^^ discusses 
problems with performing transactional management on 
mapped hies with 32 bits of address space. His studies 
were based on the VAX family. He found that the overhead 
of mapping in and out piece.s of tiles lu Hccominodate the 
protection and space limitations exceeded the overhead of 
explicit data base buffering. 

Evaluating with respect to cost, using faster discs and 
more of them lo overcome the performance limitations 
without caching is clearly not cost-effective. External cach- 
ing is more expensive than internal caching since addi- 
tional power, cooling, cabinetry, and electronics are re- 
quired in addition to the extra memory required for the 
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caching. Also, more memory would be required to achieve 
the same hit rates in external caching unless the cache 
management is integrated with the operating system 
policies. 

Evaluating with respect lo retiabilit\', internal caching 
introduces no new hardware components into the system. 
The reliability is identical lo the uncached system, whereas 
any of the external cache architectures necessarily degrades 
system reliability by introducing additional hardware com- 
ponents. The software.'firmware complexity of explicit in- 
ternal and external caching is roughly the same, so reliabil- 
ity degradation because of cache management is compara- 
ble for these alternatives, while file mapping is simpler 
and more reliable. With external caching, the posting strat- 
egy of the peripheral cache is not integrated with the system 
posting strategy, so a consistent level of integrity is not 
guaranteed for transactional data base systems unless post 
order and wait for media update are observed, thereby im- 
pacting the write handling performance of peripheral cache 
alternatives. 

Applicability to the HP 3000 Family 

The research and deveiapmeiil leading to the MPE-IV 
kerneF^ examined alternative algorithms for processor, 
main memory, disc, and semaphore management. This re- 
search focused on the interactions between algorithms 
managing these basic system resources, and determined an 
algorithm set for these resources that provides good perfor- 
mance through algorithm integration. The MPE-IV kernel 
is briefly described in the box on page 25, Above the man- 
agement of fliese basic system resources, subsystems and 
applications manage dala to provide extended system func- 
tionality. The data is kept in structured pemisnenl and 
temporary objects including files, data bases, slacks, and 
heaps. Concurrent data access is managed through locking 
or versioning. Recovery from Iransaction aborts or system 
failures is handled through checkpointing, write-ahead log- 
ging, and roll-forward/roll-back recovery. To exploit the 
system price/perforniance potential offered by evolving 
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lecimology. basic kernel main memory, disc, and processor 
management needs to be extended and integrated with sub- 
system and application data and recovery management. 
This was well demonstrated in our follow-on research to 
MPE-IV. 

After the management strategies for main memory, disc, 
processor, and semaphore resources were integrated in 
MPE-IV to produce balanced resource utilization and sig- 
nificant performance improvement across the HP 3000 fam- 
ily, a new high-end system, the Series 64, was introduced. 
Its processor is twice as fast and its memory is four times 
as large as the previous top of the line. The system perfor- 
mance improvements reaOzed with this computer were, 
however, sublinear with processor speed relative to the 
previous top-of-the line system. System performance with 
this high-end computer was found to be very sensitive to 
disc subsystem throughput and access limes, but relatively 
insensitive lo main memory capacity, in spite of the inte- 
grated strategies of the breadboard kernel, system perfor- 
mance was not st;alingas the family was extended to signif- 
icantly more powerful systems. 

We investigated this scalability problem, and found it lo 
bo caused by a lack of scalability in the algorithms used 
for subsystem and application level buffering and recovery 
management. Processor uUli^allon was limited by subsys- 
tems making processes wait for disi: accesses to complete. 
The disc wrile accesses were being generated by local buffer 
replacement and wrile-ahead log management. The disc 
read accesses were irutiated to resolve local buffer read 
misses. 

We wished to extend the basic algorithm integration for 
processor, main memory, semaphore, and disc manage- 
ment lo exploit current technology trends to realize signif- 
icant improvements in system price/ performance ratio. We 
examined disc buffering, intermediate storage levels be- 
tween primary and secondary storage, disc caching in ex- 
cess primary memor>' of the system processing unit, and 
inlegraling kernel resource management with higher-level 
data management. 

Potential For Secondary Storage Caching 

Our analysis, presented in the prei^ediiig sections, indi- 
cales that explicit internal caching is the best alternative 
for the HP 3000 fatnily. given its architecture [object sizes 
must be «64K bytesj. We proceeded to investigate whether 
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\he workload c ha racf eristics were amenable. 

Fig. 12 indicales the salient workload characteristics of 
the secondary storage referencing characteristics of the HP 
3000 family. These [:haracteristii:s were obtained using the 
measurement tools described in the box on page 23, Fig. 
12a shows the transfer sizes for the major access melhod.s 
(sequential, directory, data base). The distributions indi- 
cate that relatively small transfers are performed between 
primary and secondary storage by all the access methods, 
with 90% of the transfers less than IK bytes. Fig. 1 2b shows 
the spatial locality of access method references to second- 
ary storage. This stack depth distribution presents the prob- 
ability of a reference at a given depth into a stack of disc 
references ordered on a least recently used basis. The dis- 
Iribiilion indicates that over 70% of all disc references are 
within the last 30 disc regions referenced. Fig. 12c shows 
the temporal locality of references of the access methods. 
It indicates that 50% of the disc references are to regions 
that have been referenced within the last second. 

The resulting potential of secondary storage caching for 
the HP 3000 family is indicated in Fig. 13, This figure 
show.s the read hit rales that can be achieved by the various 
access methods as a function of cache capacity. The data 
far this graph was obtained using the disc cache simulation 
model on disc access traces obtained from an internal HP 
production facility. Forlhis installatioa, mean read hit rates 
□f 85% are achievahle with a few megabytes of memory 
applied to caching disc regions. Results using other trace 
data and field measurements of our system in use show 
read hit rates typically in the 65-85% range and read-to- 
write ratios ranging from 3:1 to 5;1. 

The Explicit Internal Disc Cache Implementation 

The MPE-iV kernel baiie was used to inVHStigale princi- 
ples and integrated approaches to caching discs in the main 
memory. Algorithm interactions were observed and im- 
provements developed. Differences between disc and main 
memory caching and areas in which architectural improve- 
ments would be of benefit were noted. 

The resource management mechanisms and strategies in 
the MPE-fV kernel provided an efficient, extensible re- 
search base. These strategies were extended to support 
explicit disc caching in primary memory. The resulting 
mechanisms and strategies integrate kernel and data man- 



agement. Knowledge of file structure and access type is 
exploited to enhance prefetch and replacement decisions 
fur disc domains. Data recovery protocols are supported 
without wait for posting through kernel adherence to post 
order constraints. Priorities of disc accesses are adjusted 
to reflect the changing urgeni:y of read and write requests 
because of data management locks or commits. Process 
priorities are adjusted to reflect lock states. 

The overall structure is shown in Fig. 14. The user pro- 
gram requests records. The file system maintains local file 
buffers, and on buffer misses or replacements, accesses the 
I/O system to initiate buffer reads or writes. Beneath the 
I/O interface, pieces of the disc memory are cached in main 
memory. Actual disc transfers are initiated by the cache 
manager. 

Read Handling 

Fig. IS shows read processing with and without caching 
enabled. Without caching, the disc transfers data directly 
to the buffer, but a disc access delay is incurred. With 
caching, the disc transfers data to an area of memory re- 
served for the disc domain, and then the data is moved to 
the buffer using the processor. On read hits, data access 
requires locating the data in memory and moving it, rather 
than incurring a disc delay. This is performed on the cur- 
rent process's stack without a process switch. 

Locating Cached Disc Regions 

The tiached disc (lorniim Inuaiion mechanism employed 
in the breadboard kernel is depicted in Fig. 16. A separate 
list is maintained for each disc that identifies the memory 
regions corresponding to currently cached domains from 
the disc. The list is ordered by increasing disc address, and 
a microcoded link-list-search ILLSH) instruction is used to 
locate a required region in the list. 

This location scheme requires about 500 instructions for 
setup and cleanup, plus two memory references and two 
compares in the LLSH instruction for each cached domain 
in the list. Thus the overhead of translation increases with 
the memory size. This is not a particularly good feature. 
Thousands of domains can be cached for each disc in a 
large memory, so the overhead of translation can became 
significant. In hindsight, more attention should have been 
paid to the location mechanism. The overhead would be 
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rednced significantly by employing a hashing rather than 
a linked-iist algorithm, but this is compUcated by the vari- 
able size of cached disc regions, .^chitectures supporting 
file mapping in virtual space eliminate this overhead al- 
together. 

Write Handling Mechanisms and Policies 

Write handling with and without internal disc caching 
is shown iu Fig. 17. Without caching, the transfer occurs 
directU' to the disc from the buffer. The transfer can be 
initiated with or without wait, with explicit completion 
synchronization occmring iater. 

When a write is initiated wiih internal disc caching, 
either a disc domain already containing the specified disc 
addresses or a free area of memory is allocated and the 
dafa is moved into that area. A write to the disc is issued, 
and the process can continue to execute. The write is con- 
sidered complete once the memory move occurs unless the 
file has been specified as requiring physical post before 
completion notification. Note that, thanks to the support 
of variable-sized segments in the HP 3000 architecture, a 
write does not require a fetch of the disc region. In paged 
systems, a write of a partial page requires thai the page be 
fetched before the write can be initiated, thereby incurring 
an additional disc access on a write miss. 

If there is currently a write pendini; againsi Ibe specified 
disc domain, the process's request is queued until the pend- 
ing write is posted to disc. The process may continue 
executing until it requests a wait for its write completion 
and the request is still queued. If the disc domain to be 
written is not currently cached, an available region of mem- 
ory is obtained to map the corresponding disc image — i.e.. 
no fetch of the disc domain to be written is required. When 
the move effecting the write takes place from the process's 
data area to the cached image of the disc, a post to the disc 
is initiated. Only the portion of the cached disc image thai 
is modified by the write is posted. After the move lo the 
disc image is pBrtormed and the post to dist; is initiated, 
the writing proce.ss is allowed to conlinue running withoul 
having to wait for ibe phy.sical posl update to complete. 
This is all handled on the curreiil process's .stack, without 
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even a process switch. 

A write-through policy was chosen for our Implementa- 
tion of infernal disc caching. The post request to disc is 
issued at a background disc priority. The priority of a pend- 
ing posl request is raised if the process wails for the post 
to complete or the region is required by the main memory* 
replacement algorithm. Thus, issuinga physical post when 
Ihe write is performed rather than waiting for replacement 
has no negative impact. Only idle disc capacity is used. 
Furthermore, pending writes of adjacent disc regions can 
be gathered into a single phj'^sical transfer, thereby reducing 
the total number of required physical accesses. 

Write-through also has another benefit. The transaction 
recover\' mechanisms require synchronization on physical 
commit at some point, so performing these early saves com- 
mit delays. 

The write protocol is shown in Fig, 18. 

We saw in the preceding sections the significant impact 
of having to wait for writes. To minimize the probability 
of waiting for disc writes lo complete, we developed a 
special mechanism that allows the specification of posting 
order constraints at post time or on a file basis so that 
posting can proceed without wait if only a posting order 
constraint exists. With this facility, the post of the log can 
be issued on a nowait basis, and the data base write can 
he executed immediately. The kernel guarantees that post- 
ing order within a serial write queue matches the 
chronological order of posl initiation. The sequence for 
write-ahead log use of this facility is depicted m Fig. 19. 

To ensure disc consistency, only one write access for a 
serial write queue can be pending at a time. This is depicted 
in Fig. 20. Since this limits parallel use of available disu 
drives, multiple serial write queues for unrelated postings 
are beneficial. 

Cache Fetch, Placement, and Replacement 

Thu kernul's main memory placement and replacement 
mechanisms were extended to handle cached disc domains 
in the same manner as segments. Thus, cached disc do- 
mains can be of variable size, fetched in parallel with other 
segments or cached disc domains, garbage collected, and 




Fig, 20, To ensure disc consis- 
tency, only one mite access for a 
serial wile queue can be pending 
at any time. Multiple serial write 
queues can be used to allow 
parallel use ot available disc 
drives lor unrelated postings. 
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Fig. 21. Dynamic partitioning of cache operstions across tfie 
system discs ot an HP 3000 system 



replaced in an inlegrateci manner with stacks, data seg- 
ments, and code segments. The relative allocation of main 
memory between slack, data, code, and cached disc dnmain 
objects is entirely dynamic, responding to Ihecurrenl work- 
ioisd requirements and current memory availability. 

Fetching and replacing differ from processor caching of 
main memories, Rao^* discusses the impact of replacement 
algcrilhms on cache performance for processor caching of 
main memory, finding that the replacement algorilhm has 
a secondary effect on cache performance. 

With internal disc caching, when a request is made to 
read data that is not currently cached, the fetch strategy 
uses knowledge of file blocking, extent structure, access 
method, and current memory loading to select the optimal 
size of disc domain to be fetched into memory. The fetch 
is performed in an unblocked manner so that the requesting 
process or another process can run in paraUel with the 
cache fetch from disc. With processor caches, the processor 
must idle on a cache miss since the process switch time 
exceeds the cache miss processing lime. 

No special treatment of cached domains is required for 
the replacement algorithm. II naturally protects the tran- 
sient objects in smaller memories because of their smaller 
interreference times, and uses largo memories far extensive 
write and read caching with the relative amount of cached 
disc space per device skewing naturally lo the heavily re- 
ferenced devices. This dynamic partitioning feature is very 
significant, since device skewing is common both in terms 
of referencing frequency and tiie spatial locality of refer- 
ences. 

Fig, 21 shows the cache hit ratios, the read-to-wrile ratio, 
and the partitioning of main memory allocated for cached 
domain.s across the discs of an HP 3000 Computer System. 
The normal LRU-type replacement algorithm does fine in 
responding to the variable demand requirements of the 
various disc volumes. 

Explicit prefetchinj^ and flushing for sequential access 
was found lo improve hit rates through simulation studies 
on standard trace files, whereas special prefetching and 
flushing for other access modes did not. 

This policy was implemented in the kernel. When a pro- 
cess finishes referencing a cached disc domain in sequen- 
tial mode, the domain is flushed immediately from main 
memory since it won't be needed again. In this way, mem- 
ory utilization is improved over that achievable with the 
kernel's standard LRU-type replacement algorithm. 

External Caching Controls 

The external controls tar caching allow caching to be 
enabled/disabled against specific discs, display the current 
status of disc caching, set posting policies on a system and 
file basis, and control the roundoff fetch sizes for random 
and sequential access. 

Defaults fnr the tuning parameters were selected based 
on simulations of disc access traces using the simulation 
model. Good defaults for random fetch sizes were found 
to be 4K byles and for sequential fetch sizes, 24K bytes. 
Large prefetches were found to pay off significantly for 
sequential, hut not for random type accesses. Rounding the 
fetching above the requested block was found to be superior 
for all access methods to fetching below the requested block 
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or centering on the requested block. The choice of tuning 
parameters is. as always, an adiustmenl to ihe access pat- 
terns of the particular snbs>'stems and data bases. 

Performance of tfie Disc Cache Implementation 

With ihese mechanisms and strategies. Ihe extended ker- 
ne! significantly reduces the traffic between the main mem- 
ory and secondary disc storage and significantly reduces 
delays in reading or wTitingdiscinforraalion. Read hit rates 
up to 85% are common for file, data base, and directory 



buffer fills. These read hits eliminate up to 65% of the disc 
accesses with a 5:1 read-to- write ratio. Because of the cach- 
ing of writes, most delays for posting are eliminated. To- 
gether, the read hits and cached ivrites eliminate 90% of 
process delays caused by disc accessing. This dramatically 
reduces semaphore holding limes, which especially bene- 
fits data base systems. 

Tlie impact on system performance over the noncached 
kernel is shown in Fig. 22. Tiiroughput improvements of 
50% and response time reductions of 5:1 are standard on 
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Fig. 22. Pertarmance of HP 3000 Series 48 and 68 systems with and without disc caching. 
Throughput improves most on Ihe larger Series 68 system with caching 
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the high end ISeries 68) while the midrange syslem (Series 
48) gets about 25%. For the low-end system (Series HI or 
33. not shown in Fig. 22), performance actually degrades, 
thus demonstrating the scaling of performance with proces- 
sor speeds. However, the midriinge syslem wilh kernel disc 
caching outperforms the high-end machine without kernel 
disc caching. 

Fig. 23 compares the kernel wilh and without raching 
for configuration.s with four 1 25M-hyte discs or I wo 4UIIM- 
byte discs. The cached syslem with two dist: servers outper- 
forms the uncached system wilh four disc servers. More- 
over, tiie cached system performance is the same wilh two 
or four discs, thereby demonstrating that caching can ex- 
ploit the system cost advantages of large-capacity discs. 

A henchmark comparisoii^^ was made between an IBM 
3033 [5 MIPS] and an HP 3000 Series 68 (1 MIPS] wilh 
and without applying disc caching in main storage. The 
benchmark ran Compserv's Materials Request and Planning 
software developed for both the HP 3000 and the IBM 3033 
systems. The benchmark consists of a materials request 
and planning batch job that performs a large data base 
implosion to determine the parts, lead times, and orders 
to manufacture a specified set of products subject to a mas- 
ter production schedule. This type of application tends to 
be highly I/O intensive since it accesses the data base in 
an inverted manner, starting with schedules and searching 
tor qualifying parts. 

The benchmark ran in 28 hours on the IBM 3033. 49 
hours on the HP 3000 Series 68 without kernel disc caching, 
and in 27.4 hours on the HP 3000 .Series fi8 wilh kernel 
disc caching. That the HP 3000 Series lifl could outperform 
the IBM 3033 in spite of the fivefold difference in processor 
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Fig. 23, Eltecis ot mulliple discs and disc caching in the HP 
3000 Series 68 Sensitivny !a the number of discs is dramat- 
ically reduced by caching. 



speed indicates that caching by the kerne! of disc domains 
in excess main storage wilh the cache management policies 
introduced in this research has potential application in 
other computer families employing locally managed, lim- 
ited caching u\ data items by data base or file systems. Data 
base and file system caching is limited to a fixed capacity 
and applies policies that optimize for the standard access 
approach. When more resources are available, such as main 
memory for a stand-alone batch job, and access is nonstan- 
dard, as in this inverted-access case, localized caching 
policies do not respond. 

In small memory systems under memory pressure and 
with slow processors, disc caching degrades performance. 
The cost of locating the cached disc domain in main mem- 
ory and moving the domain gets added to the disc access 
time. With low hit rates and slow processors, this overhead 
exceeds the benefits of caching the discs. This overhead is 
not present in architectures supporting file mapping. 

Conclusions 

This research and development effort examined alterna- 
tive approaches to exploit current trends in processor and 
memory technology to realize significant improvements in 
system price/performance ratio. The results were applied 
to the HP 3000 family. 

Caching secondary storage in primary main memory 
through explicit internal caching or file mapping, coupled 
with integrated kernel and data management algorithms, 
was found to provide the most cost-effective and best per- 
forming alternative, given current technology trends. These 
approaches are supplemented by improvements in second- 
ary storage device characteristics. 

Secondary storage cache management can be integrated 
with data management to obtain significant incremental 
performance improvements. Specific improvements were 
identified by integrating write-ahead logging wilh disc ac- 
cess scheduling through the adherence to posting order 
constraints, humping the priority of affected disc post re- 
quests at commit time, and using extent boundary and 
access type knowledge for selecting the fetch size on read 
misses and flushing on sequential access. 

Effective utilizatinn ot high-speed processors requires 
sustaining a high multiprogramming level. Improved con- 
currency control schemes, such as late binding, granular 
locking for updates, and versioning for queries, help 
acfueve this. Data base locking can be integrated with ker- 
ne! resource management by raising the prioritv of a lock 
holder when a more urgent process queues on its lock. 

Providing caching of discs in main memory differs from 
caching main memory tor processors in several ways. The 
delay in resolving a read miss is highly state-dependent, 
requiring disc qneueing and head positioning components. 
Since the read miss resolution time is long compared with 
the time required to state save the current process and 
launch another process, other ready processes can be run 
if the effective multiprogramming level is sufficiently high, 
allowing productive processor utilization while resolving 
read misses. Special knowledge of access patterns can read- 
ily be exploited to improve cache fetch and replacement 
strategies. Write misses can be resolved without requiring 
a fetch from the backup store if variable-sized allocation 
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is used, thereby using large caches to eliminate process 
waits on write misses. Special constraints exisi on theiiTite 
handling poOcies for updating the backup memorj- because 
of the requirements of wriie-ahead logging and shadow 
paging In transactional syslems. Classical techniques used 
to model processor caches did well in analyzing hit rates, 
but more complex models were required lo capture queue- 
ingand nowait effects found in secondar\' store caching. 
The implementation of explicit internal disc caching for 
the HP 3000 family graphically demonstrates the resuhs 
the analysis predicts. The effective utilization of higher- 
speed processors and reduced dependency' on the number 
of discs is clearly demonstrated. The implemenlation dem- 
onstrates further that integrated internal caching of second- 
ary storage can be accomplished with conventional 
hardware and software architectures. Disc dafa manage- 
ment integration with internal cache management can be 
achieved in existing systems by providing special functions 
to allow subsystems and applicalions to influence the man- 
agement of caching, disc scheduling, and main memory 
management to meet their performance and renovery objec- 
tives. 

We feel that the major contributions of this research and 
development effort are the performance analysis tech- 
niques, the characterization of secondary storage caching 
alternatives, and the mechanisms we developed to integrate 
Ihe requiremeiils of high-level data managemeni with the 
basic kernel management of processor, disc, primary mem- 
ory, and semaphore resources. Our leveraging of kernel 
mechanisms for transient code and data management re- 
sulted in reduced development lime, highly parallel cache 
management, and a smooth, dynamic parlilioii of main 
memory. Our extensive testing (over a year] in varied inter- 
nal production sites resulted in a high-quality product. 
However, our implementation of the location mechanism 
is inefficient tor large memories, and further work could 
be done on dalii bases and applications to exploit more 
fully the underlying cacliing. 

The performance advantages of secondary storage cach- 
ing will be needed in decentralized as welt as centralized 
transaction systems. Caching secondary storage in decen- 
tralized systems introduces problems of cache consistency 
analogous to those encountered in multiprocessors with 
private caches. Analyzing and building solutions to realize 
the performance benefits of caching secondary storage in 
decentralized configurations presents us with a challenging 
f(ili'iw-ni) lo the applied reseaich reported here. 
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